Building Knowledge-bases from the Web

نویسنده

  • Srinivasan H. Sengamedu
چکیده

The web is a vast repository of information. Most of the information on the web is meant for human consumption. Extracting structured information from the web can enable several applications like advanced ranking, semantic search, etc. In this talk, we first list different types of content available on the web, survey known techniques for extracting information from them, present the architecture of Vertex information extraction system developed at Yahoo, and discuss in detail a new technique for information extraction leveraging content redundancy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

KnowNet: A Proposal for Building Highly Connected and Dense Knowledge Bases from the Web

This paper presents a new fully automatic method for building highly dense and accurate knowledge bases from existing semantic resources. Basically, the method uses a wide-coverage and accurate knowledge-based Word Sense Disambiguation algorithm to assign the most appropriate senses to large sets of topically related words acquired from the web. KnowNet, the resulting knowledge-base which conne...

متن کامل

Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology

Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...

متن کامل

Building an Ontological Base for Experimental Evaluation of Semantic Web Applications

The increasing number of Semantic Web applications that work with ontologies implies an increased need for building ontological knowledge bases. In order to improve ontologies during their development as well as to allow applications to be experimentally evaluated prior to their complete implementation and deployment, ontology bases must be filled with experimental data (i.e., instance ontologi...

متن کامل

A Joint Foundation for Configuration in the Semantic Web

Product configuration is a major commercial application of knowledge-based systems, and joint configuration by multiple business partners is becoming a key application in today’s highly specialized economy. The required integration of configuration knowledge is a challenging task due to the variety of knowledge representation formalisms used in commercial configurators. Ontology languages such ...

متن کامل

Knowledge Bases in the World Wide Web: A Challenge for Logic Programming

Regarding the World Wide Web, knowledge bases can be categorized between (HTML-)documents and (SQL-)databases. In order to standardize them, the use of Horn logic for Web publications is proposed. The central part outlines the design of a Web search engine for processing distributed Horn-logic knowledge bases. Some of the research issues to be solved are elaborated from the perspective of (para...

متن کامل

Ideal Downward Refinement in the EL Description Logic

With the proliferation of the Semantic Web, there has been a rapidly rising interest in description logics, which form the logical foundation of the W3C standard ontology language OWL. While the number of OWL knowledge bases grows, there is an increasing demand for tools assisting knowledge engineers in building up and maintaining their structure. For this purpose, concept learning algorithms b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010